Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation

نویسندگان

چکیده

In this paper, we propose a novel text-based talking-head video generation framework that synthesizes high-fidelity facial expressions and head motions in accordance with contextual sentiments as well speech rhythm pauses. To be specific, our consists of speaker-independent stage speaker-specific stage. the stage, design three parallel networks to generate animation parameters mouth, upper face, from texts, separately. present 3D face model guided attention network synthesize videos tailored for different individuals. It takes input exploits an mask manipulate expression changes Furthermore, better establish authentic correspondences between visual (i.e., movements) audios, leverage high-accuracy motion capture dataset instead relying on long specific After attaining audio correspondences, can effectively train end-to-end fashion. Extensive experiments qualitative quantitative results demonstrate algorithm achieves high-quality photo-realistic including various according rhythms outperforms state-of-the-art.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Talking Head System for Korean Text

A talking head system (THS) is presented to animate the face of a speaking 3D avatar in such a way that it realistically pronounces the given Korean text. The proposed system consists of SAPI compliant text-to-speech (TTS) engine and MPEG-4 compliant face animation generator. The input to the THS is a unicode text that is to be spoken with synchronized lip shape. The TTS engine generates a phon...

متن کامل

A Text Based Talking Face

Facial expressions and speech are means to convey information. They can be used to reinforce speech or even complementary to speech. The main goal of our research is to investigate how facial expressions can be associated to textbased speech in an automated way. As a first step we studied how people attach smileys to text in chat sessions and facial expressions to text balloons in cartoons. We ...

متن کامل

Text Driven 3D Photo-Realistic Talking Head

We propose a new 3D photo-realistic talking head with a personalized, photo realistic appearance. Different head motions and facial expressions can be freely controlled and rendered. It extends our prior, high-quality, 2D photo-realistic talking head to 3D. Around 20-minutes of audio-visual 2D video are first recorded with read prompted sentences spoken by a speaker. We use a 2D-to-3D reconstru...

متن کامل

LuciawebGL: a new WebGL-Based talking head

In this DEMO we present the first worldwide WebGL implementation of a talking head (LuciaWebGL), and also the first WebGL talking head running on iOS mobile devices (Apple iPhone and iPad).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i3.16286